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FIRST ORDER QUANTIFIERS IN MONADIC SECOND ORDER LOGIC 
H. JEROME KEISLER AND WAFIK BOULOS LOTFALLAH 


Abstract. This paper studies the expressive power that an extra first order quantifier 
adds to a fragment of monadic second order logic, extending the toolkit of Janin and 
Marcinkowski [JM01]. 

We introduce an operation exists,(S) on properties S that says “there are n compo- 
nents having S”. We use this operation to show that under natural strictness conditions, 
adding a first order quantifier word u to the beginning of a prefix class V increases the 
expressive power monotonically in u. As a corollary, if the first order quantifiers are not 
already absorbed in V, then both the quantifier alternation hierarchy and the existential 
quantifier hierarchy in the positive first order closure of V are strict. 

We generalize and simplify methods from Marcinkowski [Mar99] to uncover limitations 
of the expressive power of an additional first order quantifier, and show that for a wide 
class of properties S, S cannot belong to the positive first order closure of a monadic prefix 
class W unless it already belongs to W. 

We introduce another operation alt(S) on properties which has the same relationship 
with the Circuit Value Problem as reach(S) (defined in [JMO01]) has with the Directed 
Reachability Problem. We use alt(S) to show that HT, Z FO(Un), Un Z FO(A,), and 
An+1 Z FOB(=,), solving some open problems raised in [Mat98]. 


§1. Introduction. This paper studies the expressive power that an extra 
first order quantifier adds to a fragment of monadic second order logic. 

Second order logic embodies many of the outstanding open problems in com- 
plexity theory. In [Fag74] Ronald Fagin showed that the class NP coincides with 
the class of properties expressible by existential second order sentences. Thus 
NP = co-NP if and only if the class of existential second order sentences is closed 
under negation. Stockmeyer [Sto77] subsequently extended Fagin’s Theorem and 
showed that the polynomial hierarchy coincides with the second order quantifier 
alternation hierarchy, thus translating to logic the problem of the strictness of 
the polynomial hierarchy. 

These hierarchy problems have been hard to attack. Fagin suggested studying 
monadic second order logic (MSO), a simplified fragment of full second order 
logic, in which second order quantifiers are only allowed over unary relations, i.e. 
subsets of the underlying universe. MSO was indeed tractable. In [Fag75] Fagin 
himself used Ehrenfeucht-Fraissé games to show that existential MSO (called 
monadic NP) is not closed under negation, thus separating monadic NP from 
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monadic co-NP. Matz and Thomas [MT97] showed that the monadic quantifier 
alternation hierarchy is strict. In particular, they showed that ©}, C B(=,) C 
An+i C 'n4i1, where B denotes Boolean closure of ©,. Their argument was 
based on growth rates of definable functions. In [Mat98] Matz used the growth 
argument to investigate the role of the positive first order closure in the monadic 
alternation hierarchy. Among other things, he showed that, on the class of finite 
graphs, Any2 Z FOB(Sn), FO(En)M FO(Mn) Z B(En), and FO(En41) 0 
FO(In+i) Z FOB(S,), where FO denotes the positive first order closure, and 
FOB denotes the first order/Boolean closure. 

Ajtai, Fagin, and Stockmeyer in [AFS98] and [AFS00] proposed closed monadic 
NP, in which first order quantifiers are freely mixed with monadic second order 
existential quantifiers, as the “right” monadic version of NP. They posed the 
problem of whether the corresponding hierarchy is strict. Marcinkowski [Mar99] 
showed that Directed Reachability is not in FO(S1), answering a question in 
[AFS98]. The tools of [AFS00] and [Mar99] were put in an abstract form by 
Janin and Marcinkowski in [JM01], to study the expressive power of fragments 
of MSO defined by prefix classes. 

A (monadic) prefix class is a regular expression V built from the first order 
quantifiers v,3, monadic second order quantifiers V,4, and the Boolean closure 
operator @. The logic L(V) is the set of formulas built from words in V and 
finite conjunctions and disjunctions. 

In [JMO1] two operations on graph properties S were defined, bool(S) and 
reach(S). They call a prefix class nontrivial if it ends in (v3®)* and contains 
either an V* or an d*. They proved the following results for nontrivial prefix 
classes V and W: 

1) If both S and its complement are expressible in L(V), but S is not express- 
ible in L(W), then bool(S') is expressible in L(33V) but not in B(L(W)). 

2) If S is expressible in L(V) but not in L(W), then reach(S) is expressible 
in L(AvvV) but not in FO(L(W)). 

In this paper we introduce two more operations, exists,,(S) and alt(S), and 
prove the following results for arbitrary prefix classes V and W: 

3) If S is expressible in L(V) but not in L(W), then for any (n — 1)-tuple u of 
first order quantifiers, exists,(S) is expressible in L(waV) but not in L(uv*W). 
(Lemma 3.2). 

4) If V contains vv and v3, and S is expressible in L(V) but not in L(W), 
then alt(S) is expressible in L(AvV)N L(V aV) but not in FO(L(W)). (Lemmas 
6.6 and 6.7). 

The operation exists,,(.S), introduced in Section 3, says “there are n com- 
ponents having property S”. 3) above shows that this operation introduces an 
“existential hardness”, so that adding a word ua of first order quantifiers before a 
prefix class V increases the expressive power monotonically in u. As a corollary, 
if the first order quantifiers are not already absorbed in V, then both the first 
order quantifier alternation hierarchy and the first order existential quantifier hi- 
erarchy inside FO(L(V)) are strict. This improves a theorem in [KW73], where 
it is shown that (when V is empty) any two distinct first order quantifier words 
v,w express different sets of properties. 
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In Section 4 we simplify the proof of the result in [Mar99] that Directed Reach- 
ability is not expressible in FO(X,). The method is extended in Section 5 to 
show that for a wide class of properties S and for any monadic prefix class W, S' 
cannot be expressed in FO(L(W)) unless it is already expressible in L(W), and 
S cannot be expressed in FOB(L(W)) unless it is already expressed in B(L(W)). 

In Section 6 we apply the ideas of Section 5 to define the operation alt and 
prove the result 4) above. alt has the same relationship with the Circuit Value 
Problem as reach has with the Directed Reachability Problem. The flexibility 
of alt allows us to strengthen some results and solve some open problems in 
[Mat98]. In particular, we show that I, Z FO(Un), Un Z FO(I,), and A,41 Z 
FOB(S,). 


§2. Basic definitions. For simplicity we only consider vocabularies contain- 
ing one binary predicate FE (for edge) and possibly several unary predicates and 
constant symbols. As usual, the logic has the equality symbol, =, as a built-in 
relation. We will not consider the case of logics with other built-in relations, 
such as linear order. All of the results in this paper hold for the class of finite 
models as well as the class of all models. That is, you can choose either one of 
the following two options at the outset, and stay with that option throughout 
the paper. 

Finite option: Models are finite directed graphs with colors and distinguished 
vertices. 

Infinite option: Models are arbitrary directed graphs with colors and dis- 
tinguished vertices. 

We use C for inclusion, C for strict inclusion, and ¢ for the negation of 
inclusion. 

In this paper we consider only monadic second order extensions of first order 
logic. By a logic we will mean a set of monadic second order formulas which is 
positive Boolean closed, that is, closed under finite conjunctions and disjunctions. 
As usual, sentences are formulas with no free variables. 

To clarify the different roles of universal and existential quantifiers, we assume 
that all negation signs are pushed inside. We shall sometimes view formulas as 
trees with nodes labelled by conjunction signs, disjunction signs, first order and 
monadic second order quantifications, and leafs labelled by literals, i.e. atomic 
and negation of atomic formulas. 

Following [JM01], a pattern is a word over the alphabet {v,3,V,3,®}, where 
Vv, are first order quantifiers, V ,3 are monadic second order quantifiers, and @ 
is the Boolean closure operator. 


We let 7 be a finite signature which contains at least one binary relation 
symbol and remains fixed throughout. For each signature 7 and each pattern w, 
we define the logic L(w) supported by w by induction as follows. 


e The empty word supports the set of all quantifier-free MSO formulas with 
signature T. 

e L(vw) is the positive Boolean closure of the set L(w) U {vay : y € L(w)} 

e L(aw) is the positive Boolean closure of the set L(w) U {ary : y € L(w)}. 

e L(V w) is the positive Boolean closure of the set L(w)U{V Xv: y € L(w)} 
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e L(Aw) is the positive Boolean closure of the set L(w) U {AX¢y: y € L(w)}. 
e L(@w) is the Boolean closure of L(w), so L(@w) = B(L(w)). 


EXAMPLE 1. The prenex sentence (vx)(ay)(V Y)(E(a,y) AY («)) is supported 
by the pattern vaV , while the sentence (v x)((ay)E(a, y)A(VY)Y(a)) is supported 
by the pattern vaV as well as by vV4. 


We shall freely make use of regular expressions which do not contain union 
to denote classes of patterns, which we will call prefix classes. The logic L(W) 
supported by a prefix class W is defined to be the union of the logics L(w), w € W. 
Note that each L(W) is positive Boolean closed. 


EXAMPLE 2. The positive first order closure of a logic L(V) is the logic 
FO(L(V)) = L((v3)*V). 

First order logic is the logic FO = L((v3)*). 

Monadic NP is the logic Sy = L(A*(v3)*) = L((Aa)*(v3)*). 

Monadic co-NP is Tl, = L(V *(v3)*) = L((Vv)*(va)*). 

Closed monadic NP is L((Av3)*). 

Closed monadic co-NP is L((Wva)*). 

Magi = L(A*V) = L((43)*V) where I, = L(V), 

Indi = L(V*V) = L((Vv)*V) where Sp, = L(V), 

An = Sn Nn. 


We do not allow regular expressions with unions, such as (4 U v)?, in the 
definition of a prefix class, because we do not have a corresponding Ehrenfeucht- 
Fraissé game in Theorem 2.2 below. 

The reader should be warned that for certain prefix classes W, it is not true 
that for every formula in L(W) is equivalent to a prenex formula in L(W), as 
the following example shows. 


EXAMPLE 3. The sentence 


avay(E(x,y) \ E(y,@)) A aaay(E(a,y) \>E(y, «)) 
is supported by the pattern 33, but there is no equivalent prenex sentence which 
is supported by 33. To support an equivalent prenex sentence one must go to the 
pattern 3343. 


In this paper, it will always be understood that S denotes a property of (en- 
riched) graphs, and that V and W denote prefix classes. 

We shall identify a class of MSO sentences with the class of graph properties 
expressible by those sentences, where a graph property is the set of graphs having 
this property. Thus we write S € L(W) if the property S is expressible by a 
sentence in L(W). 

The complement of a graph property S is the class S of all graphs that do not 
have S. The dual w of a pattern w is the pattern obtained by switching 3 with v 
and 4 with V. The dual of a prefix class W is the class W = {w: w € W}. Note 
that S € L(W) if and only if S € L(W). Thus, when we get an expressibility 
statement about S or W, we also get for free a dual statement about S or W. 

Since our convention is to push negations inside, the Boolean closure B(L(W)) = 
L(®W) of a logic is defined as the positive Boolean closure of L(W) U L(W). 
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We recall from [JM01] that each pattern w naturally corresponds to an Ehrenfeucht- 
Fraissé game between two players (Spoiler and Duplicator) played on a pair of 
(colored) graphs with distinguished points. For brevity, we will call a colored 
graph with distinguished points an enriched graph. Consider a pair A, 6 of en- 
riched graphs with the same signature. 


The game w on the pair A,B proceeds according to the following rules: 
w =3u (w=vv): 
(1) Spoiler chooses a vertex c € A (d € B). 


(2) Duplicator chooses a vertex d € B (c € A). 
(3) Spoiler and Duplicator play v on the enriched pair (A, c), (B,d). 


w = dv (w = Vv): The game proceeds according to the following rules: 
(1) Spoiler chooses a subset CC A (DC B),. 
(2) Duplicator chooses a subset D C B (C C A). 
(3) Spoiler and Duplicator play v on the enriched pair (A, C), (6, D). 
w= Ov: 
(1) Spoiler chooses either the pattern v or the dual pattern J. 
(2) Spoiler and Duplicator play the pattern chosen by Spoiler on A, B. 


w =@ (the empty word): Game is over. 

Duplicator wins the game @ on A,B if and only if A and B satisfy the same 
atomic sentences. (In other words, the tuples of distinguished vertices a, b are ei- 
ther empty or the mapping a; + 6; is an isomorphism from the colored subgraph 
generated by a to the colored subgraph generated by b.) 


We write A —, 6 if Duplicator wins the game w on the pair A, 6; otherwise 
we write A Aw B. 


REMARK 2.1. Let v= W. 

(i) L(@v) = L(@w). 

(ti) A sy B if and only if By A. 

(tit) A Sew B if and only if Aw B and By A. 
(iv) A sew B if and only if B saw A. 


The following basic theorem clarifies the role of games (see, for example, 
[EF99]). It depends on the fact that the logic L(w) is positive Boolean closed. 


THEOREM 2.2. S € L(w) if and only if for all (enriched) graphs A and B, 
AeéS and Ay B implies BE S. 4 


Thus, to show that some property S ¢ L(w), we construct two graphs A € 
S,B ¢ S and show that A —,, 6. Quite often, these graphs will be built from 
smaller graphs by means of some operations. Here is a simple lemma which is 
often used without explicit mention when working with such graphs. 


LEMMA 2.3. If Ay B then A —, B for each substring v of w. 


PRooF. Duplicator can win the v game by playing the w game, but choosing 
imaginary moves for Spoiler and making imaginary responses at times which are 
in w but not in v. 4 


The following definition and easy lemma are essentially in [JMO1]. 
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DEFINITION 2.4. If A is a graph, cone(A) is the graph formed by adding a 
new vertex r, called the root of cone(A), and adding a directed edge from r to 
each vertex of A. 

If Ay,...,An are graphs, Ay W...W A, denotes the union of disjoint copies 
of cone(A;) through cone(A,). We also writenA = AW...WA (n times). 


LEMMA 2.5. (i) If Ay B, then cone(A) + cone(B). 
(i) If m is a permutation on {1,...,n} and Aj zw By) for alli=1,...,n, 
then (Ai H...W An) sw (Bi... B,). 


ProoF. Like many results later on, this lemma is proved by induction on the 
pattern w. We mention here that in order to carry out such inductions, one 
needs to prove an analogous result for enriched graphs. For part (ii), define 
(C,W...WC,) when C1,... ,C, are disjoint enriched graphs whose signatures all 
have the same set k of colors but have disjoint sets d; of distinguished elements. 
The signature of (Cj W...WC,,) has the set & of colors again, and its set of 
distinguished elements is the union d,U---Ud,. The interpretation of each color 
in the sum is the union of its interpretations in the Cj. 

We remark that this proof, like many later proofs in this paper, needs Lemma 2.3 
to deal with the fact that a first order move is made in just one A; or B; at a 
time. 4 


§3. The power of a new first order quantifier. In this section we will 
define a simple operation on graph properties and inductively apply it to show 
that if L(V) # L(aV) and L(V) # L(vV), then the expressive power of the logics 
L(uV) increase monotonically as the string of first order quantifiers wu grows. 


DEFINITION 3.1. [fn > 0, exists,(S) is the colored graph property with two 
new colors green and white, saying that the graph has at least n components, 
each of which is a cone of a white graph which has property S and has a green 
root. 


The reader may wonder why the new colors are introduced in this definition, 
since the root of a cone is already distinguished as the unique vertex with indegree 
zero. The reason is that it takes a universal quantifier to say that a vertex has 
indegree zero, and the new colors avoid this quantifier. Note that Part (i) of the 
next lemma would be false without the new colors. However, it can be easily 
fixed if we require that all words in V contain at least one universal first order 
quantifier. 


LEMMA 3.2. Let n > 0, and suppose that u € (aU v)"71. 
(i) If S € L(V), then exists,(S) € L(uav). 
(i) If S € L(W), then exists,(S) ¢ L(uv*W). 


ProoF. (i) The proof is by induction on n. The result is clear for n = 1. 
Assume the result holds for n. Let 7 be a sentence in L(uaV) which expresses 
exists,(S). Let 2 be a new variable and let 6(a) be the formula obtained from w 
by replacing each existential quantifier from the outer us by the corresponding 
relativized quantifier sy 4 x. Then the sentence v x(x) expresses exists,+1(S), 
and belongs to L(vuaV). The sentence 


FIRST ORDER QUANTIFIERS IN MONADIC SECOND ORDER LOGIC 7 


3a[x is a green root of a white S-component and 0(x)| 
also expresses exists,+41(5) but belongs to L(suaV). 

(ii) Let w € W. Since S ¢ L(W), there are two graphs A € S, B ¢ S, such 
that Duplicator wins w on A,B. Given m, we need to show that exists,(S) ¢ 
L(uy™w). Let C= nAW (n+ m-—1)B and D= (n—1)AW(n+m)B. Then C € 
exists,(S) and D ¢ exists,(S). We describe a winning strategy for Duplicator 
in the game uv"™w on C,D. 

In the u part of the game, Duplicator’s first n—1 moves exactly mirror Spoiler’s 
first n — 1 moves. This is possible because whenever Spoiler moves in a new 
component on one side, there will still be a new component of the same kind on 
the other side. Spoiler’s next m moves will choose vertices of D. These moves 
can also be mirrored by Duplicator, because for each new component in D there 
will still be a new component of the same kind in C. At this point there is at least 
one A-component Ag of C and one B-component By of D which have not been 
pebbled. The game position is given by a pair of enhanced graphs (C,c), (D, d) 
which satisfies the hypotheses of Lemma 2.5, where the permutation 7 matches 
Ap with Bo, and matches each other component of C with an exact copy in D. 
By Lemma 2.5, Duplicator can win w on (C,c),(D,d), and thus can win the 
original game uv” w on C, D. 4 


The next corollary follows at once. Part (b) is the dual of part (a). 


COROLLARY 3.3. Let V and W be prefix classes such that L(V) Z L(W). 
Then for each m > 0, 

(a) (\{L(uaV) : ue (aU v)™} Z U{L(uy* W) su € (aUv)™}. 

(b) (MU L(V): ue (3Uv)™} Z U{L(us* W): we (auv)y™}. 4 


Note that part (a) says that there is a single property, namely exists,,(S), 
which is expressible in every one of the logics L(uaV), u € (aUv)™, and is not 
expressible in any of the logics L(uv* W), u € (aUv )”. Following our convention, 
we do not write L((sUv )’v* V) because the expression (3Uv )”v* V has a union, 
and does not correspond to an Ehrenfeucht-Fraissé game in Theorem 2.2. 

Corollary 3.3 can be sharpened as follows. Given a first order quantifier word 
u, let F'(u) be the set of all prefix classes obtained by replacing each 3 in u by 
either 3 or v*, and replacing each v in u by either v or 3*. For example, after 
absorbing single quantifiers into starred quantifiers and omitting duplicates, we 
have 

F(ava) = {ava,v*s,a*,av*,v*a"*,v*,a’v7*,v*a*v"}, 
F(vaa) = {v33,3",v*s,vav’*,a’*v*3,a*v*,v*}. 


COROLLARY 3.4. Suppose that L(V) Z L(W), and let m > 0, u € (aUv)™, 
and U € F(u). Then: 

(a) L(uaV) Z L(Uv* W). 

(b) LiuvV) Z L(Us* W). 


PROOF. We prove (a). The proof is by induction on m. When m = 0 the 
result follows from Corollary 3.3. Assume the result holds for all n < m, and let 
u € (aUv)™ and U € F(u). If U = u then the result follows from Corollary 3.3. 

Suppose U # u, and 3 is replaced by v™* at the first place where U differs from 
u. Then we have u = sat and U = sv*T where T € F(t). (One or both of s,t 
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can be empty). Then t has length less than m, and by inductive hypothesis, 
L(taV) Z L(Tv*W). Let V’ = taV,W’ = Tv*W. Then L(V’) Z L(W’). Using 
Corollary 3.3 again, we have L(saV’) Z L(sv*W’). But saV’ = sataV = uaV 
and sv*W’ = sv*Tv*W = Uv*W, so L(uaV) Z L(Uv*W) and the induction is 


complete. 
The proof is similar in the case that v is replaced by 3* at the first place where 
U differs from wu, so (a) holds in all cases. (b) is the dual of (a). 4 


Now let V = W, and consider the first order quantifier alternation hierarchy 
over V. This hierarchy is defined by 


Don(V) = L((a*v*)"V), Langa (V) = L((a*w*)"3*V), 


TI, (V) is the dual of ©°,(V), and A®, (V) = 5° (V) NH, (V). The union of each 
of these hierarchies is the logic FO(L(V)) = L((v3)*V), the positive first order 
closure of L(V). 


COROLLARY 3.5. (i) If L(aV) # L(V) and L(vV) # L(V), then the hierarchy 
is strict, that is, the logics ©°(V),U°(V), A°(V), n=1,2,... are all different. 
L( 


(ii) If L(aV) 4 L(V #4 and oe V), then for each n > 0, 
(itt) If L(aV) = ae 7 and ah V) # L(V), then for each n > 0, 
(iv) If L(aV) = L(V) = L(vV), then the hierarchy collapses to L(V). 4 


Proor. All the inclusions are clear. We use Corollary 3.4 to prove the 
strict inclusions. Suppose L(aV) 4 L(V). Put V’ = aV, so that L(V’) Z 
L(V). Let u = (av)""1, so U = (v*a*)""! € F(u) in the ee of 3.4. 
Then L(uaV’) Z L(Uv*V). We have L(uaV") = L(uaaV) C XS, 1 (V) and 
TeV ) SG, (V5 80 
Similarly, starting from t = vu we get 


To,(V) Z U3n(V). 
Part (ii) now follows, and we get (iii) by duality. Finally, (i) is proved by putting 
all of these non-inclusions together. 4 


In the next corollary we consider the hierarchy W,3W,3?W,..., which is a 
refinement of ©9(W), and W,vW,v?W,..., which is a refinement of II?(W). 


COROLLARY 3.6. (i) If L(W) Cc L(aW), then L(a"W) c L(a"*!W) for each 
n. 


(ii) If L(W) C L(vW), then L(v™W) c L(v"*!W) for each n. 4 


PRooF. (i) Suppose L(W) Cc L(aW). By Corollary 3.3 with V = aW, we 
have L(sW) Cc L(asv) = L(asaw). If L(a3W) = L(sW), then we would also 
have L(333aW) = L(aaw), contradicting L(aW) Cc L(a3sW). Therefore we 
must have L(aW) Cc L(sasW). The desired result now follows by induction. Part 
(ii) is the dual of (i). 4 
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Finally, we prove that if adding one first order quantifier increases the ex- 
pressive power of L(W), then the expressive power of L(uW) is almost always 
increased by adding one quantifier anywhere within u, and is always increased 
by adding two quantifiers anywhere within wu. 


THEOREM 3.7. Suppose that L(W) Cc L(aW) and L(W) Cc LivW). Let tue 
(aUv)* be two first order quantifier words such that: 

(a) t is a proper substring of u, and 

(b) u is not tv or ta. 
Then L(tW) Cc L(uW). 


Proor. Let V = aW and Z =vW, so L(W) Cc L(V) and L(W) Cc L(Z) by 
hypothesis. 

Suppose first that |u| = |é} +1. Then ¢t = rs and u = rqs where gq € {v,3}. 
We give the proof when g = 3. The case g = v is similar. By hypothesis (b), the 
string s is nonempty, and is not of the form 3”. We may also position q at the 
right end of an 3-block in u, so that s begins with an v. 

Suppose s = vy. Then uW = ravW = r3Z, and by Corollary 3.3, L(uW) = 
L(raZ) Z L(rv*W), so L(tW) = LirvW) Cc L(uW). 

Now suppose |s| = 2. Then either s = va or s = vv. If s = v3, then 
uW = ravaW =ravV. By Corollary 3.3, L(uW) = L(ravV) Z L(rv*3* W), 
so L(tW) = L(rvaW) Cc L(uW). If s =vv, then uW = ravvW = rav Z, and 
L(uW) = L(rav Z) Z L(rv* 3*W), so L(twW) = LirvvW) Cc L(uW). 

We next suppose that |s| > 2. Then either s = 43,8 = wav,s = wv3, 
or s = xzvv for some nonempty string x which starts with an v. Form X by 
replacing each v in x by 3% and replacing each 3 in x by v*. Then X starts with 
an 3*. 

If s = x33, then by Corollary 3.4, 


L(uW) = L(ravaaw) = L(rarav) Z Lirv* Xv*W), 


and L(tW) = L(raaaw). By checking all cases, one can see that the string 33 
fits inside v*.X; each block in x goes to the preceding block in v*X, and the final 
33 goes to the last 3* in X. Therefore L(tW) Cc L(uW). 

If s = xav, then Corollary 3.4 gives 


L(uW) = L(rarvavW) = L(raxaZ) Z L(rv* Xv* W), 


and L(tW) = L(raavW). This time xv fits inside v* Xv*, and again L(tW) C 
L(uW). The cases s = eva and s = avv are similar. 

Finally, we suppose that |u| > |¢|/+2. By adding terms to t we may assume that 
|u| = |t|+2. We need only consider the cases that u = ta3 and u = tvv, because 
in all other cases we can add one more term to ¢ and satisfy hypotheses (a) and 
(b). If w = tas then by Corollary 3.3 we have L(uW) = L(taV) Z L(tv* W), so 
L(tW) Cc L(uW). The case u = tvv is similar. This completes the proof. 4 


QUESTION 3.8. Is Theorem 3.7 still true without hypothesis (b)? 


For example, do the hypotheses of Theorem 3.7 imply that L(av"W) Cc 
L(av"t!W), or even L(aW) c L(avW)? Note that we always have L(av”W) Cc 
L(av"*?W), so either L(av"W) Cc L(av™t?W) or L(av"*!W) c L(av"*?W). 
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QUESTION 3.9. Suppose L(aW) Z L(vW) and L(vW) € L(aw). If u,v € 
(aUv)* and L(uW) C L(uW), must u be a subsequence of uv? 


§4. Directed Reachability. In this section we will work with graphs A 
which have two distinguished points, called the source and the sink. A graph 
has the Directed Reachability property, DIR.REACH, if it contains a directed 
path from the source to the sink. 

It was shown in [Mar99] that DIR.REACH does not belong to the positive 
first order closure F'O(1) of ©; =monadic NP. In this section we give a simpler 
proof of this fact. Since DIR.REACH is not in monadic NP, a fact shown in 
[AF90] and already used in the proof given in [Mar99], the desired result easily 
follows from the next lemma. 


LEMMA 4.1. Let W be a prefix class such that DIR.REACH ¢ L(W). Then 
DIR.REACH ¢ FO(L(W)). 


PrRooF. The hypothesis can be viewed as asserting that for each w € W, 
there exists two graphs A,B, such that A is in DIR.REACH, B is not, and 
Duplicator can win w on A, B. Let ag, a, be the source and sink of A, and bo, by 
be the source and sink of GB. It suffices to prove: 

(1) DIR.REACH ¢ L(vW). 

(2) DIR.REACH ¢ L(aw). 
(1 


) Let C=A+B6 and D=6+B. Here the “sum” of two graphs (A + B) is 
the graph obtained by connecting disjoint copies of A and B in “parallel”. That 
is, we first form the union of a copy of A, a disjoint copy of B, and two new 
vertices co (the new source) and c, (the new sink). Then we connect cp to both 
of the old sources ag, bo, and connect both of the old sinks a1, b; to c,;. The sum 
has the same signature as the original graphs, with just the two distinguished 
vertices Cg, C1. 

It is clear that C is in DIR.REACH and D is not. By Theorem 2.2, to prove 
that DIR.REACH ¢ L(vW) it is enough to show that C yD. 

Duplicator wins vw on C,D as follows. Spoiler picks a vertex d in one of the 
copies of B in D. Duplicator responds by picking the corresponding vertex c in 
the copy of 6 in C. Lemma 2.5 still holds for sums of the form A+ Bb, with a 
minor change in the proof to take care of the source and sink. It follows that 
(C,c) sw (D,d), and thus C yw D. 


(2) LetC = A-A+A-AandD=A-6+86.-4A. Here the “product” of 
two graphs (A- 8B) is the graph that results from connecting A and then B in 
“series” . 

The product A: B is like the sum A+ 6 except for the connections between 
the new and old distinguished vertices. For the product we connect co to ao, a1 
to bo, and by to Cj. 

Again, C is in DIR.REACH and D is not. By Theorem 2.2, it now suffices 
to prove that C 3, D. 

Duplicator can win aw on C,D with the following strategy. Spoiler’s first move 
cis in C. If c is in the left half of the product A-.A, Duplicator chooses the 
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corresponding vertex d in the left half of the product A-6 in D. It is easily seen 
that (A-A,c) sw (A- B,d), and again (C,c) sw (PD, d) and C 3y D. 

If c is in the right half of A-.A, Duplicator chooses d in the right half of the 
product 8-A in D, and we have (A-A,c) + (B-A,d), and again (C,c) —w (D, d) 
and C —TIw D. a 


The above proof fully utilizes the concept of connecting graphs in parallel and 
in series, which already appeared in [Mar99]. 


§5. Limits on the power of a first order quantifier. We will now extend 
the method of the preceding section to give a general method of proving that 
some property S is not expressible by a sentence in L(W). 

The idea is to have two operations on graphs (“addition” and “multiplication” ) 
that play the roles of connecting graphs in “parallel” and in “series” with respect 
to Property $, such that the addition operation is commutative, and a winning 
strategy for Duplicator is congruent with respect to both operations. 

We consider enhanced graphs with a fixed finite signature o, and let k be the 
number of distinguished vertices. Our main tool will be the general notion of 
a disjoint binary operation (A,B) +> Ax B, where A,B, and Ax B are graphs 
with signature a. Informally, a disjoint binary operation takes a disjoint union 
of A and B, adds new distinguished vertices c,,... ,c,, and adds edges between 
the old and new distinguished vertices in a way prescribed by a fixed graph Co 
called the connector. 


DEFINITION 5.1. A connector is a graph Co which has 3k vertices 
G1,.-- ,Qz,01,... , be, C1,--- Ck, Such that the distinguished vertices are c1,... ,Ck, 
none of the pairs (a;,a;) or (b;,b;) are edges of Co, and none of the vertices a; 
or b; are colored. 

The disjoint operation with connector Co is the binary operation (A,B) > AxB 
constructed as follows. 

Make copies of A and B with distinguished vertices a,,...,a,~ and bj,... , by 
respectively, such that A,B, and the set {c1,...,cxK} are disjoint. Then the set 
of vertices of Ax B is the union of the sets of vertices of A,B,Co, the set of 
edges of Ax B is the union of the sets of edges of A,B,Co, each color in Ax B 
is the union of its values in A,B,Co, and the distinguished vertices of Ax B are 
C1,--++ 5Ck. 

A disjoint operation * is commutative if and only if for all A and B, Ax B is 
isomorphic to Bx A. 


Note that a disjoint operation * is completely determined by its connector up 
to isomorphism. That is, if A is isomorphic to A’, and B is isomorphic to B’, 
then A * B is isomorphic to A’ * B’. Moreover, * is commutative if and only if 
the mapping which sends each a; to b; and fixes each c; is an automorphism of 
the connector Co. 


EXAMPLE 4. In the proof of Lemma 4.1, the sum A+B is a commutative dis- 
joint operation, and the connector is the graph with vertices {ag, a1, bo, b1, Co, C1}, 
distinguished vertices co,c1, and edges (co, ao), (Co, bo), (a1, €1), (01, C1). 
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In the same proof, the product A- B is a noncommutative disjoint operation. 
Its connector has the same vertices and distinguished points as above, but has 
the edges (co, ao), (a1, bo), (b1, €1)- 


EXAMPLE 5. In the trivial case that k = 0 (where the signature o has no 
distinguished vertices), there is only one disjoint operation. This operation, the 
disjoint union of a copy of A and a copy of B, is commutative and its connector 
is the empty graph. 


We remark that commutative disjoint operations are not possible for logics 
with built-in linear order (e.g.as in [Imm99]), where it is more difficult to obtain 
lower bounds on the expressive power of a fragment. 


PROPOSITION 5.2. Let - be a disjoint operation, + be a commutative disjoint 
operation, A, B andC be graphs with k distinguished vertices, and w be a pattern 
such that Ay B. Then: 

(i) 

(a) A-C sy B-C and 

(b) C- Aw C-B, 

(i.e. yw ts a congruence relation with respect to -.) 

(ii) 
(a) A+ A say AFB. 
(b) A+ Bay B+B. 
(iii) 
(a) (A: A) + (A> A) aw (A: B) + (B- A). 
(b) (A: B) + (B- A) vw (B-B)+(B-B). 


PROOF. (i) (a). Duplicator can win the w game on A-C,B-C as follows. 
When Spoiler moves in a copy of C or in the connector on either side, Duplicator 
moves in the same place on the other side. When Spoiler moves in the copy of 
A or B, Duplicator follows her winning strategy for the w game on A,B. The 
proof of part (i) (b) is similar. 

The proof of (ii) is essentially the same as the proof of Lemma 4.1. For (a), 
Spoiler must first move in a copy of B or the connector in the right side, and 
Duplicator moves in the same place in the copy of B or the connector in the left 
side. After that, Duplicator follows her w strategy. The proof of (b) is similar. 

For (iii), Duplicator’s first move must match Spoiler’s first move; if Spoiler 
moves in the left half of a product, so must Duplicator, and if Spoiler moves 
in a copy of A (6), so must Duplicator. After that, Duplicator uses her w 
strategy. 4 


Note that commutativity of the +-operation is vital in the proof of each Part 
of this lemma. The (possible) non-commutativity of the --operation is the reason 
behind the necessary complexity of Part (iii). 


THEOREM 5.3. Let - be a disjoint operation and + be a commutative disjoint 
operation. Suppose that 
(i) Whenever 
AES, BES, 
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we have 
A+BeES,B+BéS, 


and 
(A-A)+(A- AD ES, (A -B)4+(B- AES. 


Then for any prefix class W: 
(a) If S € L(W), then S € FO(L(W)). 
(b) If S ¢ L(W), then S ¢ FO(L(W)). 
(c) If S ¢é B(L(W)), then S ¢ FOB(L(W)). 


PROOF. (a) By Proposition 5.2, whenever A ~~ B, then 
(1) A+ Braww B+B, 
(2) (A- A) + (A: A) saw (A+B) + (B- A). 


Now suppose S ¢ L(W). For each w € W, S ¢ L(w), and by Theorem 2.2 there 
are graphs A€ S,B¢S such that A—, B. 

By hypotheses (i), the left hand sides of (1) and (2) belong to S while the 
right hand sides do not. Thus by Theorem 2.2 again, S ¢ L(vw) and S ¢ L(sw). 
Therefore S ¢ L(vW) and S ¢ L(3sW). It now follows by induction that S ¢ 
FO(L(W)). 


(b) This follows from Part (a) by a duality argument. If S ¢ L(W), then 
S ¢L(W). By Part (a), S ¢ FO(L(W)). Thus S$ ¢ FO(L(W)). 

(c) Suppose S ¢ B(L(W)). Let V = @W. Then B(L(W)) = L(V) and 
S¢ L(V). By Part (a), S ¢ FO(L(V)) = FO(L(W)). 4 


COROLLARY 5.4. Theorem 5.3 holds when the hypothesis (i) is replaced by the 
following simpler properties: 


(ti) A+ BES if and only if AGC S or BES. 
(itt) A-BES if and only if AGC S and Be S. 


PROOF. It is easily seen that (ii) and (iii) imply hypothesis (i) of Theorem 
5.3. 4 

Some simple (and typical) examples of properties having addition and mul- 
tiplication operations which satisfy Theorem 5.3 (i) are those dealing with the 
values of logical formulas, where addition and multiplication simply correspond 
to disjunction and conjunction. 

We use “rooted” colored graphs to encode propositional formulas, with a dif- 
ferent color for each proposition symbol and connective. By a graphical formula 
we mean a colored graph which encodes a formula in the following way. A graph- 
ical formula has one distinguished vertex, called the root, which has indegree 0 
and represents the main symbol in the formula. 

An atomic formula, which is just a predicate symbol by itself, is encoded by a 
single vertex which is a root with the corresponding color. The symbol V is blue, 
and the symbol / is yellow. Thus, the graphical formula encoding ¢ V w has a 
blue root with two outgoing edges pointing to (the roots of) graphical formulas 
encoding @ and w. Similar encoding can be done if the main connective is A or 


=a, 
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The colors serve as a convenient tool for classifying the vertices. However, 
they are not essential to the encoding mechanism and can be replaced if we wish 
by suitable “gadgets” connected to the classified vertices. 

We extend this encoding in the obvious way to stronger languages, such as 
FO, SO, or MSO. 

Now define the disjunction (conjunction) of graphical formulas A and B, de- 
noted by AV B (AA B), by forming the union of a copy of A, a disjoint copy 
of B, and a new root r, then coloring r blue (yellow), and connecting r to the 
roots of A and B. 


PROPOSITION 5.5. The operations of disjunction and of conjunction on graph- 
ical formulas are both commutative disjoint operations. = 


We next observe that the disjunction and conjunction operations behave like 
addition and multiplication in Theorem 5.3 for a very broad class of properties 
S. 


DEFINITION 5.6. Let L be a (positive Boolean closed) logic and M be any class 
of structures for L. A graph is in SAT(L, M) if and only if it encodes a sentence 
in L which is true in some structure in M. 


For example, if L is propositional logic and M/ is the class of all propositional 
structures, then SAT(L, M) is the Satisfiability Problem for propositional logic. 

If L is first order logic with equality and the constant symbol TRUE, and M 
is the class of L-structures with universe {TRUE, FALSE}, then SAT(L, M) 
is a version of the Quantified Satisfiability Problem for propositional logic. 

Note that if the set M contains only one structure, then the stronger conditions 
Corollary 5.4 (ii), (iii) hold for SAT(Z,M). For example, if L is propositional 
logic and M = {A} for a particular assignment A of truth values to propositional 
symbols, then SAT(L, M) is the Circuit Value Problem for A. 


PROPOSITION 5.7. Let L be a logic, M be a class of structures for L, and let 
S=SAT(L,M). If A€S and BES then 

()AVBES and BVBES. 

(ti) ANAES and (AAB)V (BAA) ES. 


PROOF. (i) is obvious. For (ii), note that if A € S then AA A € S;, and if 
BéSthn AABES. 4 


Now putting Propositions 5.5 and 5.7 together with Theorem 5.3, the following 
corollary is immediate. 


COROLLARY 5.8. Let W be a prefix class, L be a logic, and M be any class 
of structures for L. Then SAT(L,M) does not belong to FO(L(W)) unless it 
already belongs to L(W). 4 


§6. A sharper version of reach. In [JMO01] Janin and Marcinkowski defined 
the operation reach on properties. (The following definition, suggested to them 
by Sockmeyer, is somewhat simpler than their original definition.) 

For any property S of graphs with a distinguished vertex, reach(S) holds for a 
graph G with distinguished vertices s and t if and only if there is a directed path 
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from s to t such that for every x on this path and every y not on the path with 
E(x, y), the connected component of G\ {x} with y as a distinguished vertex has 
the property S. 

Recall from the introduction that a prefix class is nontrivial if it ends with 
(va@)* and contains either an V* or an 5*. They showed the following: 


Lemma 6.1. /JM01] If V and W are nontrivial prefir classes and S' is a prop- 
erty of graphs that belongs to L(V) but not to L(W), then reach(S') belongs to 
L(AvvV) but not to FO(L(W)). 4 


Their proof was an adaptation of Marcinkowski’s proof in [Mar99] of the fact 
that Directed Reachability does not belong to the positive first order closure of 
monadic NP. 

In this section we will apply the results of the previous section to define an 
analog of reach called alt, for which a sharper lemma can be proved. 


We consider only enriched graphs with a distinguished vertex r (called the 
root) and the disjoint colors blue, yellow, green and white. Note that having 
four colors may be thought of the result of having two possibly overlapping 
unary predicates B(x) and Y(a), where x is blue means B(x) A AY(zx), «x is 
yellow means Y(xz) \- B(x), x is green means B(x) AY (x), and x is white means 
aB(a) A7AY (a). 

We call an enriched graph a potential tree if the following holds: 

1) Each vertex has exactly one of the colors blue, yellow, green, or white. 

2) r is nonwhite and has indegree 0. 

3) Each nonwhite vertex has indegree at most 1. 

4) Each white vertex has an incoming edge from at most one green vertex. 

5) A blue or yellow vertex cannot have an outgoing edge to a white vertex. 

6) Green and white vertices have outgoing edges only to white vertices. 

7) If a,b are white vertices and there is an edge from a to b, then for each 
green vertex g, there is an edge from g to a if and only if there is an edge from 
g to b. 


Thus in a potential tree, the connected component of the root is a tree with 
blue and yellow vertices possibly leading to green vertices, which then point to 
disjoint white graphs. 


LEMMA 6.2. The property Potential Tree is in L(vvv) N L(vav). 


ProoF. It is clear that each condition 1-7 can be expressed in L(vvv). For 
L(vav), note that 4 and 7 can be replaced by: 

4a) For each white a there exists g such that for each green h, E(h,a) implies 
h=4g. 
7a) For each white a, if there is a green g with E(g,a) then there is a green g 
such that E(g,a) and for all b, E(a,b) implies E(g, b). 4 

For each green vertex g in a potential tree, the fruit of g is (the subgraph 
generated by) the set G = {x : E(g,x)}. g is then called the root of the fruit 
G. Note that each fruit G is a white graph (if the reader finds white fruits to 
be unappetizing, he can replace them by white flowers). Any vertex which is 
connected to a vertex in G by an edge belongs to G U {g}, and any two distinct 
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fruits are disjoint. Also, the graph G U {g} with g as a distinguished vertex is 
cone(G). 

In a potential tree, an alternating tree is a set of vertices T satisfying: 

Ire. 

2) If « € T and z is blue (indicating an “or” node), then there exists a vertex 
y such that E(x,y) and y € T. 

3) Ifa € T and z is yellow (indicating an “and” node), then all vertices y with 
E(x,y) belong to T. 

4) T has no white vertices. 


A co-alternating tree is a set T satisfying Conditions 1-4 with “blue” and 
“vellow” interchanged in Conditions 2 and 3, thus interchanging the roles of 
“or” and “and” nodes. 

By a fruit of an alternating (co-alternating) tree T we mean the fruit of a 
green vertex which belongs to T. 

The connected part of an alternating (co-alternating) tree T is the connected 
component of T that contains the root r. Note that for any alternating (co- 
alternating) tree T’, the connected part of T is also an alternating (co-alternating) 
tree. 


DEFINITION 6.3. alt(S) is the property of enriched graphs saying that the 
graph is a potential tree that contains an alternating tree T’ such that each fruit 
of T is an S-graph. 


A potential tree is said to be trivial if the root r is green. In a trivial potential 
tree, the root r has a fruit G, the connected component G U {r} is cone(G), and 
T = {r} is both an alternating and a co-alternating tree. Thus for any graph 
property S, cone(G) has alt(S) if and only if G € S. Here is a simple application 
of Lemma 2.5. 


LEMMA 6.4. Let W be a prefix class. If S ¢ L(W) then alt(S) ¢ L(W). 


Proor. Let w € W. By Theorem 2.2, there are A € S and B ¢ S such that 
A wy B. By Lemma 2.5, cone(A) wy cone(B). As noted above, cone(A) € 
alt(S) and cone(B) ¢ alt(S'). Then by Theorem 2.2 again, alt(S) ¢ L(W). — 


LEMMA 6.5. Let A be a potential tree. A does NOT have alt(S) if and only if 
A has a co-alternating tree T such that no fruit of T is an S-graph. 


Proor. Let n(A) be the cardinality of the set of nonwhite vertices in the 
connected component of the root r. The proof is by induction on n(A). In this 
proof, it will be understood that all trees mentioned are connected. 

For the basis step, suppose n(A) = 1, and let T = {r}. If r is an “and” vertex, 
then T is an alternating tree with no fruits, so A has alt(S), and there are no 
co-alternating trees. 

If r is an “or” vertex, then T is a co-alternating tree with no fruits, and there 
are no alternating trees, so A does not have alt(S). If r is green, then T is both 
alternating and co-alternating, and A has alt($) if and only if the fruit at r is 
an S-graph. 

Now suppose that n(A) > 1 and the lemma holds for every potential tree B 
such that n(B) < n(A). Then r is either an “and” vertex or an “or” vertex, and 
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has indegree 0 and outdegree k > 0. Let s1,...,s,% be the vertices connected to 
r by an edge. For each 7 < k, let B; be the enhanced subgraph of A with root s; 
consisting of all vertices which can be reached from s; by a directed path. Then 
n(B;) < n(A). 

Suppose first that r is an “and” vertex. Note that T is a (connected) alter- 
nating tree in A if and only if TM B; is an alternating tree in 6; for all i < k. 
Therefore A has alt(S) if and only if B; has alt(S) for all i < k. Moreover, for 
each i < k, a set U C B; is a co-alternating tree in 6; if and only ifU U{r} isa 
co-alternating tree in A. Thus A has a co-alternating tree such that no fruit is 
an S-graph if and only if for some i < k, 6; has a co-alternating tree such that 
no fruit is an S-graph. Using the inductive hypothesis, it follows that A does 
not have alt(S) if and only if A has a co-alternating tree such that no fruit is an 
S-graph. 

The proof when r is an “or” vertex is similar. 4 


LEMMA 6.6. Let W be a prefix class. Suppose that either S ¢ L(W) or 
alt(S) ¢ L(W). Then alt(S) € FO(L(W)). 


PRooF. Recall that FO(L(W)) = L((va)*W). By Lemma 6.4, alt(S') ¢ 
L(W). alt(S) has addition and multiplication operations on graphs, which con- 
nect two graphs with “or” and “and” roots respectively. The hypotheses of 
Theorem 5.3 are easily verified for these operations, and the result follows. 4 


LEMMA 6.7. Suppose V is a prefix class such that some v € V contains vv 
and V3 as substrings. For each natural number n: 

(i) If S € L(V), then alt(S) € L(AvV) ON L(WaV). 

(ii) If S € L(A"V), then alt(S) € L(A" t'vV). 

(iti) If SE L(V"V), then alt(S) € L(V"t13V). 


PRooF. (i) follows from (77) and (itz) if we let n = 0. 

(ii) We call A good if it has property S, and bad otherwise. Since S € L(A”V), 
there is some v € V such that for all good A and bad B, Spoiler wins 3”v on 
A,B. Since V is directed, we may assume that v also contains the substrings 
VV,Va. 

Let C € alt(S) and D ¢ alt(S). We first show how Spoiler wins 3”t'vv on 
C,D. If D is not a potential tree, then by Lemma 6.2, Spoiler wins by using the 
moves VVvV in vv to point out the problem in D. 

Otherwise, he starts by choosing in C an alternating tree X such that every 
fruit of X is good. Duplicator must respond with an alternating tree Y in D. 
(If she doesn’t, Spoiler can win the game using vv against a bad “and” vertex, 
and v3 against a bad “or” vertex). Then at least one fruit B of Y is bad. Since 
all fruits of X are good, Spoiler can combine his winning strategies to play 4” 
in each good fruit of X against the bad fruit B of Y. In other words, in each 3- 
move, Spoiler plays the union of the subsets of the (good) fruits of X determined 
by his winning strategies against the fixed bad fruit B. (Note that, since C is 
a potential tree, fruits with different roots are disjoint.) Spoiler then uses the 
v-move to pebble the root of B. Duplicator must respond by pebbling the root 
of a (good) fruit G of X. Finally, Spoiler can now win the game by playing his 
winning strategy for v on the pair G, B. 
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(iii) We now show how Spoiler wins V"*t+3v on C,D. Suppose first that D 
is not a potential tree. Spoiler can use the V3 moves to simulate an vy move 
d € D by choosing the set Y = {d} C D and then choosing an element of 
Duplicator’s response X C C (if Duplicator chooses X empty, Spoiler can easily 
win by choosing d with a later ¥ move). v contains the substring vv, so again 
Lemma 6.2 shows that Spoiler can win by pointing out the problem in D. 

Otherwise, Spoiler starts by choosing in D a co-alternating tree Y such that 
every fruit of Y is bad. Duplicator must respond with a co-alternating tree X 
in C. (If X is not co-alternating, Spoiler wins as before, because v contains both 
vv and v3 as substrings). From this point we argue as in part (ii) to complete 
the proof. 


Let =n, Un, An be the levels of the monadic second order quantifier hierarchy 
(defined in Section 2). 


COROLLARY 6.8. Let n> 0, let L be any of the logics %,, U, or A,, and W 
be a prefix class. 

(i) If L Z L(W), then L Z FO(L(W)). 

(i) If LZ B(L(W)), then L Z FOB(L(W)). 


PROOF. (i) For the case L = Hy, suppose S € LU, \ L(W). By Lemma 6.6, 
alt(S) ¢ FO(L(W)). For some m, S € L(a™U) where L(U) = II,_1. By 
Lemma 6.7, alt(S) € L(A" *'vU) C Sy. The case L = I, is similar. 

Some care is needed in the case L = A, because A,, is not a logic of the 
form L(V), but is the intersection of two such logics, A, = UH, NI1,. Suppose 
An Z L(W) and let S € A, \ L(W). Again, alt(S) ¢ FO(L(W)) by Lemma 
6.6. For some m, S$ € L(A™U) A L(V™U), where L(U) = I,-1 as before. By 
Lemma 6.7, 


alt(S) € L(A" VU) N L(V ™t130) C Ap. 


(ii) follows by applying (i) to the prefix class @W. 4 


We do not know whether Lemma 6.7 holds for reach in place of alt, or whether 
there is a way to obtain Corollary 6.8 using reach instead of alt. 


It was shown in [MT97] that II, Z U,, UH, Z I,, and Ans: Z B(E,). 
Combining these results with Corollary 6.8, we get the following corollary which 
solves some open problems from [Mat98]. 


COROLLARY 6.9. In the monadic second order quantifier hierarchy, 

1) Tn Z FO(En). 

2) Sq Z FO(Hn). 

3) Ansi Z FOB(E,). 4 


§7. Conclusion and open problems. We introduced the operation exists, (5) 
saying “there are n components having S”, and used it to show that if a single 
new first order existential (or universal) quantifier strictly increases expressive 
power, then additional new first order quantifiers continue to strictly increase 
expressive power. This implies the strictness of all the natural quantifier hier- 
archies in the positive first order closure of a fragment of monadic second order 
logic. 
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An important problem is to find a similar operation for the second order 
(monadic) quantifiers. This could be used to get lower bounds on the expressive 
power of second order fragments, and perhaps to improve the strictness results 
on the monadic hierarchy and solve the strictness problem for the closed monadic 
hierarchy. 

We introduced an abstract concept of addition and multiplication of graphs. 
As an application we showed that for any logic L and any class of structures 
M for L, the set of sentences in L which are satisfiable in M is not expressible 
in the positive first order closure FO(L(W)) unless it is already expressible in 
L(W). 

We introduced another operation which converts a property S which is ex- 
pressible in L(V) but not in logic L(W) to a new property alt(S) which is ex- 
pressible in both L(AvV) and L(V 3V) but not in the positive first order closure 
FO(L(W)). This was applied to the monadic second order hierarchy, showing 
that II, Z FO(S,) and A,41 Z FOB(S,). 

A related problem is to find operations which convert a property not express- 
ible in L(W) to a property not expressible in L(AW) or L(VW). A solution 
to this problem could give upper bounds on expressive power, and answer some 
outstanding open questions in monadic second order logic. 
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